Two-pass decision tree construction for unsupervised adaptation of HMM-based synthesis models

نویسنده

Matthew Gibson

چکیده

Hidden Markov model (HMM) -based speech synthesis systems possess several advantages over concatenative synthesis systems. One such advantage is the relative ease with which HMM-based systems are adapted to speakers not present in the training dataset. Speaker adaptation methods used in the field of HMM-based automatic speech recognition (ASR) are adopted for this task. In the case of unsupervised speaker adaptation, previous work has used a supplementary set of acoustic models to firstly estimate the transcription of the adaptation data. By defining a mapping between HMM-based synthesis models and ASR-style models, this paper introduces an approach to the unsupervised speaker adaptation task for HMM-based speech synthesis models which avoids the need for supplementary acoustic models. Further, this enables unsupervised adaptation of HMMbased speech synthesis models without the need to perform linguistic analysis of the estimated transcription of the adaptation data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explorer Unsupervised cross - lingual speaker adaptation for HMM - based speech synthesis

In the EMIME project, we are developing a mobile device that performs personalized speech-to-speech translation such that a user’s spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user’s voice. We integrate two techniques, unsupervised adaptation for HMM-based TTS using a wordbased large-vocabulary continuous speech recognizer...

متن کامل

Some Aspects of ASR Transcription Based Unsupervised Speaker Adaptation for HMM Speech Synthesis

Statistical parametric synthesis offers numerous techniques to create new voices. Speaker adaptation is one of the most exciting ones. However, it still requires high quality audio data with low signal to noise ration and precise labeling. This paper presents an automatic speech recognition based unsupervised adaptation method for Hidden Markov Model (HMM) speech synthesis and its quality evalu...

متن کامل

A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis

This paper presents a decision tree-based algorithm to cluster residual segments assuming an excitation model based on statedependent filtering of pulse train and white noise. The decision tree construction principle is the same as the one applied to speech recognition. Here parent nodes are split using the residual maximum likelihood criterion. Once these excitation decision trees are construc...

متن کامل

Decision Tree-Based Clustering with Outlier Detection for HMM-Based Speech Synthesis

In order to express natural prosodic variations in continuous speech, sophisticated speech units such as the contextdependent phone models are usually employed in HMM-based speech synthesis techniques. Since the training database cannot practically cover all possible context factors, decision treebased HMM states clustering is commonly applied. One of the serious problems in a decision tree-bas...

متن کامل

Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation

Within the HMM state mapping-based cross-lingual speaker adaptation framework, the minimum Kullback-Leibler divergence criterion has been typically employed to measure the similarity of two average voice state distributions from two respective languages for state mapping construction. Considering that this simple criterion doesn’t take any language-specific information into account, we propose ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Two-pass decision tree construction for unsupervised adaptation of HMM-based synthesis models

نویسنده

چکیده

منابع مشابه

Explorer Unsupervised cross - lingual speaker adaptation for HMM - based speech synthesis

Some Aspects of ASR Transcription Based Unsupervised Speaker Adaptation for HMM Speech Synthesis

A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis

Decision Tree-Based Clustering with Outlier Detection for HMM-Based Speech Synthesis

Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation

عنوان ژورنال:

اشتراک گذاری